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This article is concerned with the fluctuations and the concen- 
tration properties of a general class of discrete generation and mean 
field particle interpretations of nonlinear measure valued processes. 
We combine an original stochastic perturbation analysis with a con- 
centration analysis for triangular arrays of conditionally independent 
random sequences, which may be of independent interest. Under some 
additional stability properties of the limiting measure valued pro- 
cesses, uniform concentration properties, with respect to the time 
parameter, are also derived. The concentration inequalities presented 
here generalize the classical Hoeffding, Bernstein and Bennett in- 
equalities for independent random sequences to interacting particle 
systems, yielding very new results for this class of models. 

We illustrate these results in the context of McKean-Vlasov-type 
diffusion models, McKean collision-type models of gases and of a class 
of Feynman-Kac distribution flows arising in stochastic engineering 
sciences and in molecular chemistry. 

1. Introduction. 

1.1. Mean field particle models. Let (E n ) n >o be a sequence of measur- 
able spaces equipped with some cr-nelds (£ n )n>o, and we let V{E n ) be the 
set of all probability measures over the set E n , with n > 0. We consider a 
collection of transformations <l? n +i :V(E n ) — > V(E n +i), n > 0, and we denote 
by (r] n ) n >o a sequence of probability measures on E n satisfying a nonlinear 
equation of the following form: 

(1-1) Vn+l = $n+l(Vn)- 
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These discrete time versions of conservative and nonlinear integro-differential 
type equations in distribution spaces arise in a variety of scientific disciplines 
including in physics, biology, information theory and engineering sciences. 
To motivate the article, before describing their mean field particle inter- 
pretations, we illustrate these rather abstract evolution models working out 
explicitly some of these equations in a series of concrete examples. The first 
one is related to nonlinear filtering problems arising in signal processing. 
Suppose we are given a pair signal-observation Markov chain (X n , Y n ) n >o on 
some product space x R rf2 ), with some initial distribution and Markov 
transition of the following form: 

F((X ,Y ) G d(x, y)) = r) (dx)g (x, y)X {dy), 

P((X n+ i,y n+ i) £ d(x,y)\(X n ,Y n )) = M n+1 (X n ,dx)g n+1 (x,y)\ n+ i(dy). 

In the above display, A n stands for some reference probability measures 
on M. d2 , g n is a sequence of positive functions, M n+ \ are Markov transitions 
from R dl into itself and finally rjo stands for some initial probability mea- 
sure on M. dl . For a given sequence of observations Y = y delivered by some 
sensor, the filtering problem consists of computing sequentially the flow of 
conditional distributions defined by 

rf n = Law(X n |y =Vo,---,Y n = y n ) 

and 

rjn+i = Law(X n+ i|Y = yo,---,Y n = y n ). 

These distributions satisfy a nonlinear evolution equation of the form (1.1) 
with the transformations 

$n+l(Vn)(dx') = / rj n (dx)M n+1 (x,dx') and 

(1.2) 

J Vn{dx')G n {x') 

for some collection of likelihood functions G n = g n (-,y n ). Replacing these 
functions by some ]0, lj-valued. potential function Gn 

on R dl , we obtain 

the conditional distributions of a particle absorption model X' n with free 
evolution transitions M n and killing rate (1 — G n ). More precisely, if T 
stands for the killing time of the process, we have that 

(1.3) ff n = Law (X' n \T > n) and r] n+ i = Law(X' n+1 \T > n). 

These nonabsorption conditional distributions arise in the analysis of con- 
finement processes, as well as in computational physics with the numerical 
solving of Schrodinger ground state energies (see, e.g., [11, 13, 28]). 

Another important class of measures arising in particle physics and stochas- 
tic optimization problems is the class of Boltzmann distributions, also known 
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as the Gibbs measure, defined by 

(1.4) Vn (dx) = ^e-^ v ^\(dx), 

with some reference probability measure A, some inverse temperature pa- 
rameter and some nonnegative potential energy function V on some state 
space E. The normalizing constant Z n is sometimes called "partition func- 
tion" or the "free energy." In sequential Monte Carlo methodology, as well as 
in operation research literature, these multiplicative formulae are often used 
to compute rare events probabilities, as well as the cardinality or the volume 
of some complex state spaces. Further details on these stochastic techniques 
can be found in the series of articles [5, 29, 30]. To fix the ideas we can 
consider the uniform measure on some finite set E. Surprisingly, this flow of 
measures also satisfies the above nonlinear evolution equation, as soon as rj n 
is an invariant measure of M n , for each n > 1, and the potential functions 
G n are chosen of the following form: G n = exp ((/3 n +i — Pn)V). The partition 
functions can also be computed in terms of the flow of measures (%,)o<p<n 
using the easy-to-check multiplicative formula, 

Z n = / ri P (dx)G p (x), 

0<p<n^ 

as soon as /3o = 0. In statistical mechanics literature, the above formula is 
sometimes called the Jarzynski or the Crooks equality [8, 19, 20]. Notice that 
the stochastic models discussed above remain valid if we replace e~^ n+lV by 
any collection of functions g n +i s.t. g n +i = 9n x G n = Yl 0<p<n G p , for some 
potential functions G n , with n > 0. Further details on this model, with sev- 
eral worked-out applications on concrete hidden Markov chain problems and 
Bayesian inference can be found in the article [10] dedicated to sequential 
Monte Carlo technology. All of the models discussed above can be abstracted 
in a single probabilistic model. The latter is often called a Feynman-Kac 
model, and it will be presented in some details in Section 2.1. 

The mean field-type interacting particle system associated with equa- 
tion (1.1) relies on the fact that the one-step mappings can be rewritten in 
the following form: 

(1-5) $ n+ i(r? n ) =r) n K n+ljT)n 

for some collection of Markov kernels K n+ \^ indexed by the time parameter 
n > 0, and the set of measures /i on the space E n . We already mention that 
the choice of the Markov transitions K n ^ is not unique. Several examples 
are presented in Section 2 in the context of Feynman-Kac semigroups or 
McKean-Vlasov diffusion-type models. To fix the ideas, we can choose el- 
ementary transitions K n ^(x,dx') = <& n (rj)(dx') that do not depend on the 
state variable x. 



4 



P. DEL MORAL AND E. RIO 



In the literature on mean field particle models, the transitions K n> „ are 
called a choice of McKean transitions. These models provide a natural inter- 
pretation of the flow of measures r\ n as the laws of the time inhomogeneous 
Markov chain X n with elementary transitions 

P(X n E = Kn^^-ijdi) with r? n _i = Law(A" n _i) 

and starting with some initial random variable with distribution t/q = Law(.X'o) 
The Markov chain X n can be thought of as a perfect sampling algorithm. For 
a thorough description of these discrete generation and nonlinear McKean- 
type models, we refer the reader to [9]. In the further development of the 
article, we always assume that the mappings 

{x % n )\<i<N G E n !->• K n+ l,l/N-£ N =1 S j ( X n^n+l) 

are f^^-measurable, for any n > 0, N > 1, and 1 < i < N, and any mea- 
surable subset A n+ \ C E n+ \. In this situation, the mean field particle inter- 
pretation of this nonlinear measure valued model is an -valued Markov 
chain d N) = <& m ) i<i<N, with elementary transitions defined as 

N 
i=l 

with V^-=j^J2 S d N ' j) - 

In the above displayed formula, J-^ stands for the cr-field generated by 

the random sequence (£p )o<p<ru an( i dx = dx 1 x • • • x dx N stands for an 
infinitesimal neighborhood of a point x = (x 1 , . . . , x ) £ The initial sys- 
tem consists of N independent and identically distributed random vari- 
ables with common law ijq. As usual, to simplify the presentation, when there 
is no possible confusion we suppress the parameter N, so that we write £ n 
and ^ instead of £n and ^ ■ The state components of this Markov 
chain are called particles or sometimes walkers in physics to distinguish the 
stochastic sampling model with the physical particle in molecular models. 

The rationale behind this is that i j is the empirical measure associated 
with N independent variables with distributions K n+l r) N (C n , dx), so as soon 
as 1]^ is a good approximation of r] n then, in view of (1.6), rj^ +1 should be 
a good approximation of rj n +i- Roughly speaking, this induction argument 
shows that 77^ tends to r] n , as the population size N tends to infinity. 

These stochastic particle algorithms can be thought of in various ways: 
from the physical view point, they can be seen as microscopic particle inter- 
pretations of physical nonlinear measure valued equations. From the pure 
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mathematical point of view, they can also be interpreted as natural stochas- 
tic linearizations of nonlinear evolution semigroups. From the probabilistic 
point of view, they can be interpreted as interacting recycling acceptance- 
rejection sampling techniques. In this case, they can be seen as a sequential 
and interacting importance sampling technique. 

For instance, in the context of the nonlinear filtering equation (1.2), the 
mean field particle model associated with the flow of optimal one-step pre- 
dictors rj n , with the McKean transitions K n>ri (x,dx') = $„(??) (dx'), is the 
Qj£«i) ^-valued Markov chain defined by sampling N conditionally indepen- 
dent random variables £ ra+ i = (^n+i)l<i<N, with common distribution given 

by 

/ 1 N \ N r (p \ 

\ j=l / i=l Z^=lWilUJ 

By construction, the resulting particle model is a simple genetic-type stochas- 
tic algorithm: the mutation and the selection transitions are dictated by the 
prediction and the updating transitions defined in (1.2). During the selec- 
tion transition, one updates the positions of the particles in accordance with 
the fitness likelihood functions G n . This mechanism is called the selection- 
updating transition as the more likely particles with high G7 n -potential value 
are selected for reproduction. In other words, this transition allows particles 
to give birth to some particles at the expense of light particles which die. 
The second mechanism is called the mutation-prediction transition since at 
this step each particle evolves randomly according to the transition ker- 
nels M n . Another important feature of genetic- type particle models is that 
their ancestral or their complete genealogical tree structure can be used to 
approximate the smoothing problem, including the computation of the dis- 
tribution of the signal trajectories given the observations. Further details on 
this subject can be found in [9, 13]. 

The same genetic-type particle algorithm applies for the particle absorp- 
tion model (1.3) and the Boltzmann-Gibbs model (1.4), by replacing, re- 
spectively, the likelihood functions by the nonabsorption rates G n and the 
fitness functions G n = exp ((/3 n +i — Pn)V)- 

In the reverse angle, the occupation measures of a given genetic-type 
particle mean field model converge, as the size of the population tends to 
infinity, to the solution of an evolution equation of the form (1.1), with the 
one-step transformations (1.2). These limiting models are often called the 
infinite population models. For a recent treatment on these genetic models, 
we refer the reader to [16]. 

The origins of genetic-type particle methods can be traced back in physics 
and molecular chemistry in the 1950s with the pioneering works of Harris 
and Kahn [17] and Rosenbluth and Rosenbluth [27]. During the last two 
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decades, the mean field particle interpretations of these discrete generation 
measure valued equations were increasingly identified as a powerful stochas- 
tic simulation algorithm with emerging subjects in physics, biology and en- 
gineering sciences. They have led to spectacular results in signal processing 
processing with the corresponding particle filter technology, in stochastic 
engineering with interacting-type Metropolis and Gibbs sampler methods, 
as well as in quantum chemistry with quantum and diffusion Monte Carlo 
algorithms leading to precise estimates of the top eigenvalues and the ground 
states of Schroedinger operators. For a thorough discussion on these appli- 
cation areas, we refer the reader to [9, 10, 15], and the references therein. 
To motivate the article, we illustrate the fluctuation and the concentration 
results presented in this work with three additional illustrative examples, 
including Feynman-Kac models, McKean-Vlasov diffusion-type models, as 
well as interacting jump type McKean model of gases. 

We end this Introduction with some more or less traditional notation 
used in the present article. We denote, respectively, by M(E), Ai$(E) and 
13(E), the set of all finite signed measures on some measurable space (E,£), 
the convex subset of measures with null mass and the Banach space of all 
bounded and measurable functions / equipped with the uniform norm ||/||. 
We also denote by Osci(2£), the convex set of £ -measurable functions / 
with oscillations osc(/) < 1. We let //(/) = / fj,(dx) f (x) , be the Lebesgue 
integral of a function / £ 13(E), with respect to a measure [/, £ M.(E). We 
recall that a bounded integral operator M from a measurable space (E, £) 
into an auxiliary measurable space (F,F) is an operator / i— > M(f) from 
13(F) into B(E) such that the functions M(f)(x) := f F M(x,dy)f(y) are 
^-measurable and bounded, for any / £ B(F). A Markov kernel is a pos- 
itive and bounded integral operator M with M(l) = 1. Given a pair of 
bounded integral operators (Mi, M2), we let (M1M2) the composition opera- 
tor defined by (M1M2X/) = Mi(M2(/)). For time homogenous state spaces, 
we denote by M m = M m_1 M = MM™" 1 the mth composition of a given 
bounded integral operator M, with m > 1. A bounded integral operator M 
from a measurable space (E,£) into an auxiliary measurable space (F,T) 
also generates a dual operator /u 1 — >- fiM from M(E) into Ai(F) defined by 
(fJ,M)(f) := n(M(f)). We also used the notation 

K([f - K(f)) 2 )(x) := K([f - K(f)(x)) 2 )(x) 

for some bounded integral operator K and some bounded function /. 

When the bounded integral operator M has a constant mass, that is, 
when M(l)(x) = M(l)(y) for any (x,y) £ E 2 , the operator \i 1— > /j,M maps 
Mq(E) into Mo(F). In this situation, we let /3(M) be the Dobrushin coef- 
ficient of a bounded integral operator M defined by the formula /3(M) := 
sup{osc(M(/));/£Osci(F)}. 
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1.2. Description of the main results. The mathematical and numerical 
analysis of the mean field particle models (1.6) is one of the most attrac- 
tive research areas in pure and applied probability, as well as in advanced 
stochastic engineering and computational physics. 

The fluctuation analysis of these discrete generation particle models around 
their limiting distributions is often restricted to Feynman-Kac-type models 
(see, e.g., [7, 9, 12, 14] and references therein) or specific continuous time 
mean field models including McKean-Vlasov diffusions and Boltzmann-type 
collision models of gases [24, 31]. 

In the present article, we design an original and natural stochastic per- 
turbation analysis that applies to a rather large class of models satisfying a 
rather weak first-order regularity property. We combine an original stochas- 
tic perturbation analysis with a concentration analysis for triangular arrays 
of conditionally independent random sequences, which may be of indepen- 
dent interest. Under some additional stability properties of the limiting mea- 
sure valued processes, uniform concentration properties with respect to the 
time parameter are also derived. The concentration inequalities presented 
here generalize the classical Hoeffding, Bernstein and Bennett inequalities 
for independent random sequences to interacting particle systems, yielding 
very new results for this class of models. 

To describe with some precision this first main result we observe that the 
local sampling errors associated with the corresponding mean field particle 
model are expressed in terms of the centered random fields , given by 
the following stochastic perturbation formulae: 

(1-8) Vn = Vn-lK,^ + 4^ W n- 

To analyze the propagation properties of these local sampling errors, up 
to a second-order remainder measure, we further assume that the one-step 
mappings governing equation (1.1) have a first-order decomposition 

(1.9) *„fa) - $ n (M) ~ (r? - n)D^ n 

with a first-order integral operator from B(E n ) into B{E n ^i), s.t. 

L)^$ n (l) = 0. The precise definition of the first-order regularity property (1.9) 
is provided in Definition 3.1. 

Our first main result is a functional central limit theorem for the random 
fields 

(1.10) V n N :=VN[^- Vn ]. 

This fluctuation theorem takes, basically, the following form. 

Theorem 1.1. 

• The sequence (W^) n >o converges in law, as N tends to infinity, to the 
sequence of n independent, Gaussian and centered random fields (W n ) n >o 
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with a covariance function given for any f,g£ B{E n ), and any n > 1, by 

(1.11) E(W n (f)W n (g)) = Vn-lKn^dlf ~ Kn^-AMg - Kn^M}) 

and, for n = 0, by 

E(Wo(/)Wo( 5 )) = ml(f - vo(f))(g - vo(g))]. 

• For any fixed time horizon n > 0, the sequence of random fields Vj^ con- 
verges in law, as the number of particles N tends to infinity, to a Gaussian 
and centered random fields 

n 
p=0 

In the above display, T> p ^ n stands for the semigroup associated with the 
operator V n = A^Sv 

A complete detailed proof of the functional central limit theorem stated 
above is provided in Section 5, dedicated to a stochastic perturbation anal- 
ysis of mean field particle models. 

We let $ Pj n = $p+i,n ° ^Vfi> < p < n, be the semigroup associated with 
the measure valued equation defined in (1.1). For p = n, we use the conven- 
tion $ njn = Id, the identity operator. By construction, we have 

&p,n(v) ~ - (V ~ f^D^p,n and A, p $ p ,n = T> p>n . 

The fluctuation theorem stated above shows that the fluctuations of rj^ 
around the limiting measure r\ n is precisely dictated by first-order differential- 
type operators D Vp & p ^ n of the semigroup around the flow of measures 
rj p , with p<n. Furthermore, for any f n £ Osci(-E n ), one observes that 

n 

nvnUn?) =Y,n{w P [D Vp %, n {f n )\) 2 ) 

p=0 



(1.12) 



n 

<^^ (jD ^ n) 2 := -2 
p=0 



with the uniform local variance parameters 

al:= sup sup \n{K n ^[f n - K n ^{f n )] 2 )\ 

/„eOsci(£; n )^eP(£„-i) 

and 

P(D$ Pin ) := sup (3(D v $ Ptn ). 

r,dV{E p ) 

The second part of this article is concerned with the concentration prop- 
erties of mean field particle models. These results quantify exponentially 
small probabilities of deviations events between the occupation measures 
r)n and their limiting values. The exponential deviation events discussed in 
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this article are described in terms of the parameters 

n 

a 2 n <ffi:=Y,KD$ P! n) 2 and 6*:= sup /9(£>* p , n ). 
P =o 

Besides the fact that the nonasymptotic analysis of weakly dependent vari- 
ables is rather well developed, the concentration properties of discrete gen- 
eration and interacting particle systems often resume to asymptotic large 
deviation results, or to nonasymptotic rough exponential estimates (see, 
e.g., [9] and references therein). Our main result on this subject is an orig- 
inal concentration theorem that includes Hoeffding, Bennett and Bernstein 
exponential inequalities for mean field particle models. This result takes, 
basically, the following form. 

Theorem 1.2. For any N > 1, n > 0, f n £ Osci(E n ), and any x>0 
the probability of each of the following pair of events is greater than 1 — e~ x 



yN 



and 



(/„) < -^=(1 + eZ \x)) + >/N3*fce? 



V n N (f n ) < "7=(1 + % + 



In the above display, r n stands for some parameter whose values only depend 
on the amplitude of the second-order terms in the development (1.9), and 
the pair of functions (eo,£i) are defined by 

(1.13) eo(A) = |(A-log(l + A)) J £i(A) = (1 + A) log (1 + A) — A. 

Under additional stability properties of the semigroup associated with the 
limiting model (1.5), the parameters (a n , /3 n ,6*,r n ) are uniformly bounded 
w.r.t. the time parameter. 

A complete detailed proof of the functional concentration inequalities 
stated in Theorem 1.2 is provided in Section 5.3. Some of the consequences 
of the concentration inequalities stated above are provided in Section 4. To 
give a flavor of these results, using a Bernstein- type concentration inequality 
we will check that 

AT/ » ■» - r n, > \ - ^ 2 



(1.14) hmsuplogP C(/ n )>-|= + A <- 

The detailed proof of this asymptotic estimate is provided on page 16. This 
observation shows that this concentration inequality is "almost" asymptot- 
ically sharp, with a variance-type term whose values are pretty close to 
the exact limiting variances presented in (1.12). A more precise asymptotic 
estimate would require a refined moderate deviation analysis. We hope to 
discuss these properties in a forthcoming study. 



10 



P. DEL MORAL AND E. RIO 



The outline of the rest of the article is as follows. To motivate the present 
article, we have collected in Section 2 three different classes of abstract 
mean field particle models that can be studied using the fluctuation and the 
concentration analysis developed in this article. 

In Section 3, we discuss the main regularity properties used in our anal- 
ysis. In Section 4, we illustrate the impact of Theorem 1.2 with some more 
Bennett and Hoeff ding- type concentration properties, as well as Bernstein- 
type concentration inequalities and uniform exponential deviation properties 
w.r.t. the time parameter. Section 5 is mainly concerned with the detailed 
proofs of the theorems stated above. We combine a natural stochastic per- 
turbation analysis with nonlinear semigroup techniques that allow us to 
describe both the fluctuations and the concentration of the mean field mea- 
sures in terms of the local error random field models introduced in (1.8). The 
functional central limit theorem is proved in Section 5.1. In Appendix A. 6, 
we provide a preliminary convex analysis including estimates of inverses 
of Legendre-Fenchel transformations of classical convex functions needed 
in this article. In Section 5.2, we prove a technical concentration lemma 
for triangular arrays of conditionally independent random variables. In Sec- 
tion 5.3, we apply this lemma to prove concentration inequalities for mean 
field models. 

2. Some illustrative examples. 

2.1. Feynman-Kac models. As mentioned in the Introduction, the first 
prototype model we have in mind is a class of Feynman-Kac distribution 
flow equation arising in a variety of application areas including stochastic 
engineering, physics, biology and Bayesian statistics. These models are de- 
fined in terms of a series of bounded and positive integral operators Q n from 
E n —i into E n with the following dynamical equation: 

(2.1) V/„ G B{E n ) Vn (fn) = 7?n-l(Qn(/n))A?n-l(Qn(l)) 

with a given initial distribution r/o £ V(Eq). To avoid unnecessary technical 
discussions we simplify the analysis and we assume that 

Vn > 0< inf G n {x) < supG n (x)<oo with G n {x) := Q n+ \{l){x). 
Rewritten in a slightly different way, we have 

Vn = ®n(Vn-l) ■= *„_i(7fo_i)M n with M n (f n ) = Q n (fn)/Qn(l) 
and the Boltzmann-Gibbs transformation $ n from V(E n ) into itself given 

by 

V/ n G B(E n ) * n (r/n)(/n) = Vn(G n f n )/Vn(G n ). 

Using the ratio formulation (2.1) of the semigroup, we will check in Ap- 
pendix A. 3 that the first-order decomposition (1.9) is met with the first- 
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order operator defined by 

The nonlinear filtering model (1.2), the particle absorption model (1.3) 
and the Boltzmann-Gibbs distribution flow (1.4) can be abstracted in this 
framework by setting 

Q n+ i(x,dx') = G n (x)M n+1 (x,dx'). 

We leave the reader to check that this flow of measures satisfy the recursive 
equation (1.1) for any choice of Markov transitions given below: 

(2.2) K n+ltVn (x,dy) = e n G n {x)M n (x,dy) + (1 - e n G n {x))^ n+1 (r] n ){dy). 

In the above displayed formula e n stands for some [0, l]-valued parameters 
that may depend on the current measure rj n and such that HffnGVJI < 1. In 
this situation, the mean field iV-particle model associated with the collection 
of Markov transitions (2.2) is a combination of simple selection/mutation 

— — . 

genetic transition £ n ~* £ n = (£^)i<?,<tv Cn+i- During the selection stage, 
with probability E n G n (£ l n ), we set ^ n = ; otherwise, the particle jumps to a 
new location, randomly drawn from the discrete distribution ^ n (r]^). During 
the mutation stage, each of the selected particles ~» Cn+i evolves according 
to the transition M n+ \. If we set e n = 0, the above particle model reduces 
to the simple genetic-type model discussed in (1.7) in the Introduction. 

2.2. Gaussian mean field models. The concentration analysis presented 
in this article is not restricted to Feynman-Kac-type models. It also ap- 
plies to McKean-type models associated with a collection of multivariate 
Gaussian- type Markov transitions on E n = M. d , defined by 

K n „(x, dy) = — . 
" M V(2^det(Q„) 
(2-3) i 

x expl --{y - d n (x,ri))'Q~ 1 (y - d n (x,r])) idy, 

with a nonsingular, positive and semi-definite covariance matrix Q n and 
some sufficiently regular drift mapping d n : (x, 7]) e R d x V(R d ) ^ d(x, ?]) G 
M, d . In Appendix A. 5, for d = 1 we will check that any linear drift function 
d n of the form d n (x,rj) = a n (x) + rj(b n )c n (x), with some measurable (and 
nonnecessarily bounded) function a n , and some pair of functions b n and 
c n £ £>(M), the first-order decomposition (1.9) is met with the first-order 
operator defined by 

D^ n {f){x) := [#„,„(/)(*) - *„(A*)(/)] 

+ b n {x) / fi(dy)c n (y)K n ^(y,dz)f(z)(z-d n (y,fi)). 
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In this context, the iV-mean field particle model is given by the following 
recursion: 

vi<i<iv i i n = d n {i i n _ l ^_ l ) + wi 

where (W^)i>o is a collection of independent and identically distributed 
d- valued Gaussian random variables with covariance matrix Q n . The con- 
nection between these discrete generation models and the more traditional 
continuous time McKean- Vlasov diffusion models is as follows. Consider the 
partial differential equation 

^ d d 

i,j=l i=l 

where fit is a probability measure on M. d , and bt some drift term associated 
with some interaction kernels b' t and given by 

b t (x,fi) = J b' t (x,x')fi(dx'). 

Under appropriate regularity conditions, one can show that rjt is the marginal 
distribution at time t of the law of the solution of the nonlinear stochastic 
differential equation 

(2.4) dX t = b t (Xt,Ht)dt + dB t , 

where Bt is a d-dimensional Brownian motion. These models have been intro- 
duced in the late- 1960s by McKean [23]. The convergence of the mean field 
particle model associated with the diffusion (2.4) has been deeply studied 
in the mid-1990s by Bossy and Talay [3, 4], Meleard [24] and Sznitman [32]. 
We also refer the reader to the more recent treatments on McKean- Vlasov 
diffusion models by Bolley, Guillin and Malrieu [1] and Bolley, Guillin and 
Villani [2]. Besides the fact that these continuous time probabilistic mod- 
els are directly connected to a rather large class of physical equations, to 
get some computationally feasible solution, some kind of time discretiza- 
tion scheme is needed. Mimicking traditional time discretization techniques 
of deterministic dynamical systems, several natural strategies can be used. 
For instance, we can use a Euler-type discretization of the diffusion given 
by (2.4) as follows: 

on the time mesh (t n )n>0) with (t n — t n -±) = A, with some initial ran- 
dom variable with distribution fiQ = Law(X^). In this situation, the el- 
ementary transitions of the approximated random states are of the 
form (2.3), with the identity covariance matrices Q n = Id, and the drift 
functions d n (x, rj) = x + bt n _ x (x, rf) . The refined convergence analysis of these 
discrete time approximation models for more general models, including gran- 
ular media equations is developed Malrieu and Talay [21, 22]. 
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We mention that the semigroup derivation approach for functional fluc- 
tuation theorems and concentration inequalities developed in this article do 
not apply directly to any nonlinear diffusion equations with general interac- 
tion kernels q. The semigroup derivation technique requires one to control 
recursively time the integrability properties of the semigroup associated with 
the first and second-order derivative terms. A rather crude sufficient con- 
dition is to assume that the drift terms d n is of the form discussed in the 
discrete time model. 

2.3. A McKean model of gases. We end this section with a mean field 
particle model arising in fluid mechanics. We consider a measurable state 
space (S n ,S n ) with a count ably generated cr-field and an (<S n <g)£ n )-measurable 
mapping a n be a from (S n x E n ) into M+ such that J v n (ds)a n (s,x) = 1, for 
any x £ E n , and some bounded positive measure u n € M.(S n ). To illustrate 
this model, we can take a partition of the state E n = UseS„ ^ s associated 
with a countable set S n equipped with the counting measure v n (s) = 1 and 
set a n (s,x) = 1a s (x). We let K n+ i tT) be the McKean transition defined by 



In the above displayed formula, M n stands for some Markov transition from 
(S n x E n ) into E n+ \. The discrete time version of McKean's two- velocities 
model for Maxwellian gases corresponds to the time homogenous model on 
E n = S n = {—1,-|-1} associated with the counting measure v n and the pair 
of parameters 



In this situation, the measure valued equation (1.1) takes the following 
quadratic form: 



We leave the reader to write out the mean field particle interpretation of this 
model. For more details on this model, we refer to [31]. In Appendix A. 4, we 
will check that the first-order decomposition (1.9) is met with the first-order 
operator defined by 



(2.5) 




a n (s,x) = l s (x) and M n+1 ((s,x),dy) = S sx (dy). 



Vn+ l(+l) = Vn (+l) 2 + (l- Vn ( + l)) 2 . 



D^ n+1 (f)(x) = [K n+1 ^(f)(x) - $ n+ i (//)(/)] 



+ / v n (ds)[a(s 7 x) - fx{a(s,-))]n(M n+1 (f)(s,-)). 



3. Some weak regularity properties. To describe precisely the concen- 
tration inequalities developed in the article, we need to introduce a first 
round of notation. 
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Definition 3.1. We let T(E,F) be the set of mappings <I> :\i £ V(E) \-t 
£ V(F) satisfying the first-order decomposition 

(3.1) - $(r?) = (/i - T})Dr,$ + ft *( M> t/). 

In the above displayed formula, the first-order operators (T^r]^) v ^v(E) 1S 
some collection of bounded integral operators from E into F such that 

Vr]£V(E),Vx£E (jD„$)(1)(x)=0 and 

(3.2) 

0(V$):= sup f3(D v &) < oo. 

The collection of second-order remainder signed measures (7£*(/i, v))(p,r])£V(E 2 ) 
on F are such that 

(3.3) < | |(m-i/)® 2 (»)|12*(/,^), 

for some collection of integral operators i?J from S(.F) into the set Osci(-E) 2 
such that 

sup / osc(g 1 )osc(g 2 )R^(f,d(g 1 0g 2 ))<osc(f)5(R iS ') 

(3.4) 

with 6(R*) < oo. 

This rather weak first-order regularity property is satisfied for a large class 
of one-step transformations <3? n associated with a nonlinear measure valued 
process (1.1). For instance, in Section 5.3 we shall prove that the Feynman- 
Kac transformations 3> n introduced in (2.1) belong to the set T(E n -i,E n ). 
The latter is also met for the Gaussian transitions introduced in (2.3) and for 
the McKean-type model of gases (2.5) presented in Section 2.3. The proof 
of this assertion is rather technical and it is postponed in Appendix A. 4. 

We assume that the one-step mappings 

^:f»£ V{E n ^) — > * n (/i) := fiK n4l £ V(E n ) 

governing equation (1.1) are chosen so that 3> n £ Y(E n -i,E n ), for any n > 1. 
The main advantage of the regularity condition comes from the fact that 
^P,n £ ^(Ep^En) with the first-order decomposition-type formula 

for some collection of bounded integral operators D^pn from E p into E n 
and some second-order remainder signed measures TZ^ v ' n (77, /i). For further 
use, we let r n be the second-order stochastic perturbation term related to 
the quadratic remainder measures R® p < n and defined by 

n 
p=0 
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4. Some exponential concentration inequalities. Let us examine some 
more or less direct consequences of the concentration inequalities stated in 
Theorem 1.2. 

When the Markov kernels K n ^ = K n do not depend on the measure fx, 
the iV-particle model reduces to a collection of independent copies of the 
Markov chain with elementary transitions P n = K n . In this special case, the 
second-order parameters vanish (i.e., r n = 0), while the first-order expansion 
parameters (a n ,f3 n ) are related to the mixing properties of the semigroup of 
the underlying Markov chain; that is, we have that 

n n 

W n = Yl a l^ P P^f <Pl = Yl ^ P V,nf With P p , n = K p+1 , K n -iK n , 

p=0 p=0 

with the Dobrushin ergodic coefficient f3(P Ptn ) associated with P pn . When 
the chain is asymptotically stable in the sense that sup n>0 Y^p=Q @{Pp,n) < 
oo, the first-order expansion parameters given above are uniformly bounded 
with respect to the time parameter. 

In more general situations, the analysis of these parameters depends on 
the model at hand. For instance, for time homogeneous Feynman-Kac mod- 
els [i.e., E n = E and (G n ,M n ) = (G,M)] these parameters can be related to 
the mixing properties of the Markov chain associated with the transitions M. 
To be more precise, let us suppose that the following condition is met: 

(M) m 3m>l,3e m >0 s.t. 

(4.1) 

V(x,y)eE 2 M m (x,-)>e m M m (y,-). 

It is well known that the mixing-type condition (M) m is satisfied for any 
aperiodic and irreducible Markov chains on finite spaces, as well as for bi- 
Laplace exponential transitions associated with a bounded drift function and 
for Gaussian transitions with a mean drift function that is constant outside 
some compact domain. To go one step further, we introduce the following 
quantities: 

(4.2) 5 m :=sup J] (G{x p )/G{y p )). 

0<p<m 

In the above displayed formula, the supremum is taken over all admissible 
pair of paths with elementary transitions M . In this situation, we can check 
that 

r n < 4ro 3i i (m) , 6* < 25 m /e m 

as well as 

a 2 n < Aw2 t 2{fn)a 2 and j3 2 n < 4n7 2 ,2("i) 

with the uniform local variance parameter a 2 and a collection of parameters 
w k,l( m ) such that ro fc i (m) < m<5 m _i5^/e^+ 2 . The detailed proof of these 
estimates can be found in Appendix A. 3. 
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As we mentioned above, in the special case where the Markov kernels 
K njfi = K n do not depend on the measure /i, the random measures rj^ co- 
incide with the occupation measure associated with N independent and 
identically distributed random variables with common law n„. In this situ- 
ation, the pair of events described in Theorem 1.2 resumes to the following 
Bennett and Hoeffding-type concentration events, respectively, given by 

[^-^](/n)<^r X (^) and [^- Vn ](f n )<^l3 n . 

The first inequality can be described more explicitly using the analytic es- 
timates 



1 V ' ~ log(l + (x/3) + sT&) ~ 

In the context of Feynman-Kac models, the second-order terms can be 
estimated more explicitly using the upper bounds 

i , s / log(l + 2x + 2Jx) - 2Jx r- 

e Hx) <2x + log 1 + 2x + 2^ + SV n n V — ^ 2x + 2 v^- 

2x + x 

A detailed proof of the upper bounds given above is detailed in Appendix A. 6, 
dedicated to the convex analysis of the Legendre-Fenchel transformations 
used in this article. The second rough estimate in the r.h.s. of the above 
displayed formulae leads to Bernstein- type concentration inequalities. 

Corollary 4.1. For any N > 1 and any n > 0, we have the following 
Bernstein-type concentration inequalities: 

1 ■ log P ( [rfi -Vn](fn)> £ + X) > ^ ( Ua n + V + A f 2r n + * 



iv ° v " 1W " _ JV 7 - 2 VV Viv/ V 3 

and 

~ logP([^ " *J(/») > £ + A) > y ( + ^) 2 + 2r n A X ' 

In terms of the random fields , the first concentration inequality stated 
in Corollary 4.1 takes the following form: 



logP(K v (/„,)>-= + A 



~ 2 {{ b - an+ ViV J + VN\ n+ 3 
A 2 



N-+oo 2(b* a n ) 2 ' 
This proves the asymptotic estimate presented in (1.14). 
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Last, but not least, without further work, Theorem 1.2 leads to uni- 
form concentration inequalities for mean field particle interpretations of 
Feynman-Kac semigroups. 

Corollary 4.2. In the context of Feynman-Kac models, under the 
mixing type condition (M) m introduced in (4-1), for any N > 1, any n>0 
and any x > the probability of each of the following pair of events: 

[rfi ~ Vn](fn) < ^3,lH(l + £ - 1 (x)) 



85 m , s 2 -if X \ 

e m \ia 2 T7J 2 ,2{rn)N J 



and 



[Vn ~ VnKfn) < ^tZ7 3| l(m)(l + S^(x)) + 



2w2, 2 {m)x 



N 

is greater than 1 — e~ x . 



5. A stochastic perturbation analysis. 

5.1. Proof of the functional central limit theorem. 



Definition 5.1. We say that a collection of Markov transitions K„ from 
a measurable space (E, S) into another (F, F) satisfies condition (K) as soon 
as the following Lipschitz-type inequality is met for every / G Osci(-F): 

(5.1) (K) \\[K,,-K v ](f)\\<J\(»-r,)(h)\Tf(f,dh). 

In the above display, stands for some collection of bounded integral 
operators from B(F) into B(E) such that 

(5.2) sup [ osc(h)T?(f,dh) <osc(f)5(T K ), 

for some finite constant <5(T*) < oo. In the special case where K v {x,dy) = 
&(rj)(dy), for some mapping $ : r\ G V{E) \— > $(77) £ V(F), condition (5.1) is 
a simple Lipschitz-type condition on the mapping In this situation, we 
denote by (<3?) the corresponding condition; and whenever it is met, we says 
that the mapping <3? satisfy condition (<3?). 

We further assume that we are given a collection of McKean transitions 
K n ^ satisfying the weak Lipschitz-type condition stated in (5.1). In this 
situation, we already mention that the corresponding one-step mappings 
<& n {ji) = r]K n)r j, and the corresponding semigroup $ Pj „ satisfies condition 
($p,n) for some collection of bounded integral operators T* p '". 
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In the context of Feynman-Kac-type mdels, it is not difficult to check that 
condition (3> n ) is equivalent to the fact that the McKean transitions K n ^ 
given in (2.2) satisfy the Lipschitz condition (5.1). The latter is also met 
for the Gaussian transitions introduced in (2.3) as soon as the drift function 
d(x, 77) is sufficiently regular. As before, this condition is met for the Gaussian 
transitions introduced in (2.3) and for the McKean-type model of gases (2.5) 
presented in Section 2.3. For a more detailed discussion on these stability 
properties, we refer the reader to the Appendix, on page 23. 

Notice that the centered random fields introduced in (1.8) have con- 
ditional variance functions given by 

(5.3) E(^(/ B ) 2 |j£0 = ^ 

Using Kintchine's inequality, for every / € Osci (E n ), N > 1 and any n > 
and m > 1 we have the L2 m almost sure estimates 

(5.4) E(|Wf (/ n )| 2m | F { n N _\) ll{2m) < b(2m) with b(2m) 2m := 2^ m [2m)\ / m\ . 
We can also prove the following theorem. 

Theorem 5.2. The sequence (W J f r ) n >o converges in law, as N tends to 
infinity, to the sequence of n independent, Gaussian and centered random 
fields (W n ) n >Q described in Theorem 1.1. 

The proof of this theorem follows the same line of arguments as those we 
used in [9] in the context of Feynman-Kac models. For completeness, and 
for the convenience of the reader, the complete proof of this result is housed 
in Appendix A. 2. 

Let us examine some direct consequences of this result. Combining the 
Lipschitz property ($ p , n ) of the semigroup & p . n with the decomposition 

n 

[rfi ~ Tin] = X> P ,n(^) " <W<M^_ 1 ))], 
p=0 

we find that 

n r. 

^N\lV%-Vn](fn)\=J2 / \W p N (h)\T^ Ni) (f,dh). 

p=o j P p 1 

In the above displayed formulae, we have used the convention $0(^-1) = VOi 
for p = 0. From the previous almost sure estimates, we readily conclude 
that 

n 

sup VNE(\[r,% - Vn](fn)\ 2m ) 1/(2m) < b{2m)Yj{T^). 
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We are now in position to prove the fluctuation Theorem 1.1. Using the 
decomposition 

Vn = W» + V»_ x V n + y/NR*»(r,%_i,r,n-i), 
we readily prove that 



(5-5) V n N = Y,W p N V p , n + ^=K», 

with the remainder second-order measure 

n-1 

p=0 

In the above display, V p>n = £> p +i, . . . ,U n -iV n stands for the semigroup 
associated with the integral operators V n := D rin _ 1 Q n , with the usual con- 
vention T> n ^n = Id, for p = n. Using a first-order derivation formula for the 
semigroup <& pn (cf., e.g., Lemma A.l on page 24), it is readily checked that 

D rip %,n = (D Vp % +1 )(D np+1 % +lin ) = V p+1 {D Vp <S> P:n ) = V p>n . 

Using the fact that 

n—l 

l^(/n)l<E/ \(v P N r 2 (9)\< P+1 (f>dg), 

p=0 J 

we conclude that, for any m > 1, we have 

n—l / p \ 2 

p=0 \q=0 J 

This clearly implies that -^=TZ^ converge in law to the null measure, in the 

sense that ^=7£^ (f n ) converge in law to zero, for any bounded test function 

f n on E n . Using the fact that converges in law to the sequence of n 
independent, random fields W n , the proposition is now a direct consequence 
of the decomposition formula (5.5). This ends the proof of Theorem 1.1. 

5.2. A concentration lemma for triangular arrays. For every n > and 
N > 1, we let Xn := (X^ )i<i<N be a triangular array of random vari- 
ables defined on some filtered probability space (f^J^) associated with 

a collection of increasing cr-fields {F^) u >q. We assume that (X n N ' )i<?<jv 
are J 7 ^ ^conditionally independent and centered random variables. Suppose 
furthermore that 

Vn>0 a n <X^<b n and E((X^' i) ) 2 |^_ 1 ) < c 2 n 
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for some collection of finite constants (a n , b n , c n ), with the convention J 7 ^ = 
{0, 0} for n = 0. For any n > 0, let 

N 

:=S» + < where AS* := S% - = £ 

i=l 

and is a random perturbation term such that 

Vm>l E(|i^n lAn < b(2m) 2 d n 

for some finite constant d n . We use the convention 5^ = 0, for n = 0. We 
set 



with the middle point 



2 
V 

p=0 p=0 



Lemma 5.3. For any A > 1 and any n > 0, i/ie probability of each of 
the following pair of events 

(5.6) T% < d n (l + e^(x)) + m*b*e? (j^j 
and 

(5.7) <d n {l + e^(x))+K^N 
is greater than 1 — e~ x , for any x>0. 

Remark 5.4. Notice that (5.7) gives always a better concentration in- 
equality when ^2p =0 Cp > J2p=o^p- ^ n * ne °PP os ite situation, if Ylp=o c p < 
^p=o^p; inequality (5.6) gives better concentration estimates for sufficiently 
small values of the precision parameter x. 

Before getting into the details of the proof of the above lemma, we examine 
some direct consequences of these inequalities based on Legendre-Fenchel 
transforms estimates developed in Appendix A. 6. First, combining (A. 16) 
with (A. 15) we observe that, with probability greater than 1 — e~ x , 

< d n (l + 2^ + 9 (x)) + o* (cnVNy/te + Ncfa (jjL^j ) 

with the pair of functions 

„ / x , / r- s r- log(l + 2v/i + 2x))-2^/x 
(x) := 2x + log(l + 2^ + 2s) - 2^ + &v v J.' — < 2x 

2x + 2y/X 
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and 

, lW;= VS+(4x/3) , 
U ' log(l + (x/3) + " 3 

The upper bounds given above together with (A. 8) imply that, with proba- 
bility greater than 1 — e~ x , 



If <d n + A n x+^j2xBN, 

where 

A n :=(2d n +^ and := (V2d n + &*c n ViV) 2 . 

Using these successive upper bounds, we arrive at the following Bernstein- 
type inequality: 



In much the same way, starting from (5.6), we have, with probability 
greater than 1 — e~ x , 



(5.9) Tf < d n (l + 2(x + y/x)) + 5 n \/2^iV = d n + A n x + ^]2xB^ 
with the pair of constants 

A n :=2d n and 5^ := (V2d n + 5 n VN) 2 . 

Using these successive upper bounds, we arrive at the following Bernstein- 
type inequality: 

(5. 10) 4'Wf4 + ^y((^§) !+ M » A 



Proof of Lemma 5.3. First, we observe that 

\2m 



Mt G [0, l/(2d n ) [ E(e"tf ) < V ^1^6(2m) 

z — / m 



m! 

m>0 



To obtain a more explicit form of the r.h.s. term, we recall that b(2m) 2m 
K(X 2m ) with a Gaussian centered random variable with K(X 2 ) = 1 and 

W e [0, l/2[ E(exp{dX 2 }) = £ S —b(2mf m = -=L= 

From this observation, we readily find that 

Vt G [0, l/(2d n )[ L^ n (t) := logE^" "^) < a , n {t) := a (td n ). 
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Using (A. 7), we obtain the following almost sure inequality: 

logE(e tAS "V^-i) < iv(g) 2 «i(M). 

It implies that 

n / \ 2 

Vt>0 L^ n (t):=logE(e tS ^)<N^2 (?) «i(M) < 

p=0 \ 

with the increasing and convex function a^ n (t) = Nc^aiib^t). 

Using (A. 8), we now obtain the following Cramer-Chernoff estimate: 

(5.11) V* > F(S% + R^>r n + (L^Y\x) + (L^y\x)) < e~*. 

In other words, the probability that 

SZ + RZ <r n + (L^)- 1 (x) + (L^)~ 1 (x) 

is greater than 1 — e~ x , which, together with the homogeneity properties 
of the inverses of Legendre-Fenchel transforms recalled in Appendix A. 6, 
gives (5.6). 

The proof of (5.7) is based on Hoeffding's inequality, 

81ogE(e fX " iV ' l) \T%_ X ) < t 2 (b n - a n f. 

From these estimates, we readily find that L Xn (t) < a^ n (t) := NS^t 2 /2. Ar- 
guing as before, we find that 

(L?*)- 1 (x)<(ag*)- 1 (x) = y/2xN3*. 

We end the proof of the second assertion using (5.11). This ends the proof 
of the lemma. □ 

5.3. Concentration properties of mean field models. This section is con- 
cerned with the proof of Theorem 1.2. To simplify the presentation, we set 

2 >w? : = 2 >*,«_ 1 )*p.« and K p , n = Tl^. 
Under our assumptions, we have the almost sure estimates 
P(V$) < P(V$ Ptn ) := sup P(V v $ Ptn ). 

In this notation, one important consequence of the above lemma is the fol- 
lowing decomposition: 

n 
p=0 
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with the pair of random measures , J^) given by 

n n 

^:=£< P ff and J» := VN^pA^ , <M<i))- 

p=0 p=0 

In what follows / n stands for some test function f n € Osci(.E n ). Combin- 
ing (5.4) with the generalized Minkowski integral inequality we find that 

from which we readily conclude that 



iVE(|^ n (r ?p Ar ,cl» p (r ? ^ 1 ))(/ n ,)nj-^) 1 / m < b(2m) 2 5(R^), 



E(\VNJ»(f n )\ m ) 1/m = NE[ 



p=0 



m\ l/m 



<b{2m) 2 ^5{R^). 

p=0 

Notice that 

n N 

^=EE*''(/n) where X$<>(f n ) = U^p™ (/„)), 

p=0 t=l 

and the random measures Up N,% ^ are given, for any g p £ Osci(£^), by 

U^Hdp) :=5p(^ ) )-^ )< _ 1 fe)(^-i i) )- 

In the further development of this section, we fix the final time horizon n 
and the the function f n £ Osci(E n ). To clarify the presentation, we omit the 
final time index and the test function f n , and we set, for any p in [0,n], 

p N 
q=0 i=l 

and 

k=0 

At the final time horizon, we have 

p = n S% = VNI% and R% = Vn,J*. 

By construction, these variables form a triangular array of J r J ^ 1 -condition- 
ally independent random variables and 

E((A^) 2 |J^i) = 0. 
In addition, we readily check the following almost sure estimates: 

\X( N ti\</3(V$ p , n ) and Ei^) 2 ^) 1 / 2 < ^(D$ P) „) 
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for any < p < n. The proof of the theorem is now a direct consequence of 
Lemma 5.3. 

APPENDIX 
A.l. A first-order composition lemma. 

Lemma A. 1. For any pair of mappings <3?i G T(Eq, E\) and $2 G E2) 
the composition mapping ($2 $1) £ Y(Eo,E2) and we have the first-order 
derivation-type formula 

(A.l) V v (<S> 2 o<S> 1 ) = V ri <S> 1 V< i>l{r]) <5> 2 . 

Proof. To check this property, we first observe that under this condi- 
tion, we clearly have the Lipschitz property, 

(*) |[*(M)-*fa)](/)l< / \(»-v)(h)\T*(f,dh), 

for some collection of integral operators T* from B(F) into the set Osci(E') 
such that 

(A.2) sup foBc(h)T?(J,dh)<oBc(J)6(T*) 

for some finite constant 5(T*) < 00. Using this property, we easily check 
that (A.l) is met with 

/8(P($2 o $1)) < /8(Z>* 2 )i8(2>*i) 

and 

This ends the proof of the lemma. □ 

We also mention that for any pair of mappings <E>i : 77 E V(Eq) i— >■ ^ £ 
"P(i5i) and $2 :?7 G ^(-^l) $1 £ V(E 2 ), the composition mapping $ = $ 2 ° 
$1 satisfies condition (<3?) as soon as this condition is met for each mapping. 
In this case, we also notice that 

$(T*a°*i)<$(T* 2 ) x 5(T^). 

Suppose we are given a mapping defined in terms of a nonlinear transport 
formula 

$(77) = 77/^, 

with a collection of Markov transitions from a measurable space (E, £) 
into another (F,J-) satisfying condition (K). Using the decomposition 

- $(??) = [r? - + ^ - 
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we readily check that 

(K) => ($) with T*(f,dh) = 5 KvU) {dh) + Tf(f,dh). 

A.2. Proof of Theorem 5.2. Let F N = {F^ ;n > 0} be the natural filtra- 
tion associated with the iV-particle system The first class of martin- 
gales that arises naturally in our context is the M. d - valued and .F^-martingale 
M?(f) defined by 

n 

(A.3) M»(f) = X>£U) " 

p=0 

where f p :x p € E p \-^ f P {x p ) = (f£ (Xp))it=i,...,d S K rf is a (f-dimensional and 
bounded measurable function. By direct inspection, we see that the t> th com- 
ponent of the martingale (/) = (M^ (f u )) u =i ... d is the (i-dimensional 
and F^-martingale defined for any u = 1, . . . , d by the formula 

n 
p=0 

with the usual convention J<" ^jv = ??o = ^o^-i) for p = 0. The idea of the 
proof consists of using the CLT for triangular arrays of Revalued random 
variables ([18], Theorem 3.33, page 437). We first rewrite the martingale 
vNM^if) in the following form: 



N n 




This readily yields y/NM^{f) = Y/k=i U k U) wh ere for any 1 < k < 
(n + 1)N with k = pN + i for some i = 1, . . . , N and p = 0, . . . , n 

U?(f) = -^(/ p (£f < l) ) - K p ^Jf p )(^ N _f)). 

We further denote by the cr-algebra generated by the random variables 
£p for any pair index (j,p) such that pN + j < k. It can be checked that, for 
any 1 < u < v < d and for any 1 < k < (n + 1)N with k = pN + i for some 
i = 1, . . . , N and p = 0, . . . , n, we have E(Uj? (f u )\Gk-i) = and 
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This also yields that 

pN+N 

E(u?(nuF(r)\?Z-i) 

k=pN+l 

= Vp-llKp,^ Kfp - K p,v%_JpW ~ K p,r$_Jp)\\- 

Our aim is now to describe the limiting behavior of the martingale yN M^{f) 

in terms of the process X^(f) d =' Y^k=i +N (f). By the definition of the 
particle model associated with a given mapping <I> n , and using the fact that 
[^TV 1 = M' one & e * s that for any 1 <u,v < d 
[Nt]+N 

k=l 

[Nt]-N[t] 

— u [t]\J >J )~\ ]y l°[t]+iU >J J _L/ [i]U >/ ))> 

where, for any n > and 1 < u, v < d, 

n 
p=0 

Under our regularity conditions on the McKean transitions, this implies that 
for any l<i,j<d, 

[Nt]+N 

Eiu^nu^niTU-^Ctirj"), 

*■ — ' TV— >oo 

k=l 

with 

n 

c n (r,n = E^-i[^-i((/p - K p,v P -Jp)(fp - K P ^j v p ))\ 

and, for any t € R+, 

ct(r,n=c [ t ] {r,n+{tKc® +1 (r,r)-c [t ](r,n)- 

Since \\Ug (/)|| < ^(V P <„ for any 1 < k < [Nt] + N, the conditional 

Lindeberg condition is clearly satisfied, and therefore one concludes that 
the M. d - valued martingale {X^(f);t G M+} converges in law to a continuous 
Gaussian martingale {Xt(f);t 6 K+} such that, for any 1 < u,v < d and 
t G M+, {X(f%X(r)) t = C t (f u ,r). Recalling that = y/NMj*(f), 

we conclude that the M. d - valued and .F^-martingale y/NM^ (f) converges 
in law to an Revalued and Gaussian martingale M n (f) = (M n (f u )) u= i ... d 
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such that for any n > and 1 < it, v < d 

n 

(M(n,M(r)) n = Y,V P -i[K P , Vp ^((fp - K PtVp _j;)(f; - K p ^j;))\, 

p=0 

with the convention Ko„ t = rjQ for p = 0. 

To take the final step, we let (</?n)n>o be a sequence of bounded mea- 
surable functions, respectively, in B{E n ) dn . We associate with tp = (ip n ) n 
the sequence of functions / = (fp)o<p<n defined for any < p < n by the 
following formula: 



/„ = (/ p U )n=0,...,n = (0, . . . , 0, ip p , 0, . . . , 0) E B{E } 



\doH \-d 



V) 



p 



In the above display, stands for the null function in B{E p ) dq (for q^p). 
By construction, we have, = <p u and for any < u < n, we have that 

/" = (fp)o<p<n 

= (0,. .. ,0,^,0,. ..,0) 

E B(E ) d ° x • • • x B(E u ) du x • • • x B(E n ) dn 

so that 



''u—i 
and therefore 



NM« (/) := (VNM£ (D)o< u < n = (Kf W)o<«<n == < V (p). 

We conclude that (<p) converges in law to an (n + l)-dimensional and 
centered Gaussian random field V n ((p) = (V u ((p u ))o< u < n with, for any < 

u, v < n, 

E(K(^)K(^)) 

This ends the proof of the theorem. 

A. 3. Feynman-Kac semigroups. In the context of Feynman-Kac flows 
(2.1) discussed in the Introduction, the semigroup & P) n is given by the fol- 
lowing formula: 

Th(f) = ^(Q^"^)) With Qp> n = Qp+1> • • • ' Qn-lQn- 

For p = n, we use the convention Q n . n = Id, the identity operator. Also 
observe that 

[$pM " %,n(v)](f) = -77T ~ »j)A,*p,n(/) , 

h L \S J p,n,ri) 

with the first-order operator 

A,* Pl n(/) := G p>n>r) P Ptn (f - 9p, n (v)(f)). 
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In the above display G p>njTj and P p ^ n stand for the potential function and 
the Markov operator given by 

G p>n>v :=Q p>n (l)/r](Qp jn (l)) and P p>n (f) = Qp,n{f) / Qp,n(X) • 
It is now easy to check that 

rc**»0i, »/)(/) :=- J > -q]® 2 (G P ,n,q®iW/))- 
Using the fact that 

D v $ p>n (f)(x) = G Pirl!V (x) / [P Pin (f)(x) ~ Pp,n{f)(y)}G p , n ,r,{y)r](dy), 



we find that 

V/ e Osci(^ n ) ||I>„*p in (/)|| < q P ,nP(P Pl n) 

with 

Q pn (l)(x) 

q P ,n = SUp- TTTT-r. 

1,1/ Qp,n(l)(y) 

This implies that 

P(D$ p>n ) < 2q Ptn P(P p>n ). 

Finally, we observe that 

i®2 f Gp,n,r/ Dp,n,ri(f) 



\K* 9 ' n M(f)\<iH,nP( D P*)) 



[fj, - rjf 



2q P)Tl P(D p , 



from which one concludes that 

6(R*'-») < 2qlJ{D^ n ) < Aqlj{Pp, n ). 

We end this section with the analysis of these quantities for the time 
homogeneous models discussed in (4.1) and (4.2). Under the condition (M) m 
we have for any n > m > 1 and p > 1 , 

(A.4) qp, p+n <S m /e m and P(P p , p+n ) < (1 - £^/<5 m _i)^ . 

The proof of these estimates relies on semigroup techniques (see [9], Chap- 
ter 4, for details). Several contraction inequalities can be deduced from these 
results, given below. 

For any k > and for I = 1, 2, 

(a.5) ±4 M < < : = 1 _ ( ^ ( !^;£_ i))r 

Notice that 

tJ7 fc) ,(m) < mi m _i 7 m < m5 m -i^/d +2 , 

(2- {£m/ S m-l)r 
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and that 

r n < 4tz7 3i i(m) and 6* < 2S m /e m 

as well as 

o"n < 4ti72,2("i)o' 2 and /3 n < 4tz7 2) 2(m) with cr 2 := sup er„(<l). 

n>l 

A.4. McKean mean field model of gases. We consider McKean-type mod- 
els of gases (2.5) presented in Section 2.3. To simplify the presentation, we 
consider time homogeneous models, and we supress the time index. In this 
notation, we find that 

[K v -K fl ](f)(x) = J v(ds)[r)-»](a(s r ))M(f)(s,x). 

Observe that 

[r l -n}(K v -K fl )(f)(x) = j v(ds)[ V - v](a(s,-))[r,- v](M(f)(s,-)). 
Using the decomposition 

(A.6) d>(r?) - $(//) = (r, - n)K„ + ii{K n - + [ V - ^](K V - K^) 
we readily check that $ € T(E,E) with the first-order operator 
D lt *(f)(x) = [K li (f){x)-*(ji)(f)] 



+ J u{ds)[a{s,x) - fx(a(s,-))MM(f)(s r )) 
and the second-order remainder measure 

K*Mif) = J[v-^f 2 (9sHds) with g s = a(s,-)®M(f)(s,-). 
In this situation, we notice that 

P(p$)<P(M) 1 + J u(ds)osc(a(s,-)) 

and 

<5(i2*) < p{M) [ u(ds)osc(a(s,-)). 



A. 5. Gaussian semigroups. To simplify the presentation, we only dis- 
cuss time homogenous and one-dimensional models. We consider the one- 
dimensional gaussian transitions on E = M defined below: 

K v (x,dy) = -J=expl-^(y- d(x,r])) 2 ^dy 

with some linear drift function d n of the form d(x, rf) = a{x) + ij(b)c(x), with 
some measurable (and nonnecessarily bounded) function a, and some pair 
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of functions b and c 6 B(R). We use the decomposition 

[K 1J -K fl ](f)(x) = [ K fl (x,dy)@(A fl , v (x,y))f(y) 



+ J K t i(x,dy)A^ r] (x,y)f(y) 
with Q(u) = e u — 1 — u and the function A fJitV (x,y) defined by 
A^(x,y) = log ^f^j(y) = Ag) (*,„) + Ag) (x,y) 

with 

Aj$(z,i/) : = - d(a?,/i)][l/ - d(a5,A*)] = c ( x )(v ~ t*)(P)[y - d(x,fi)], 

Agi(x,y) := -±[d(x, V ) - d(x,v)} 2 = -±c(x) 2 [( V - ^)(b)} 2 . 

Under our assumptions on the drift function d, we have 

|AW(x,|/)| < ||c||osc(6)|y-d(a;,At)| and \A^(x,y)\ < \\c\\ 2 osc(6) 2 /2. 

Using the fact that |Q(u)| < eH« 2 /2, after some elementary manipulations 
we prove that 



sup 



[K v - K^](f)(x) - J K, l (x,dy)AW(x,y)f(y) 



<C7[(7/- M )(6)] : 



with some finite constant C < oo whose values only depend on ||c|| and 
osc(6). On the other hand, we have 

J(r,-H)(dx) J K^x,dy)AW(x,y)f(y) = ( V -^ 2 (b^(K^f))) 

and 

J M (dx) | ^(x,dy)Ag) ? (x, 2 /)/(y) = (r ? - M )(%( J ft:;(/)) 
with the bounded integral operator K'^ defined by 

Using the decomposition (A. 6) we prove that 

$(r?)(/ - $(//)(/)) = (r? - + ft* (r?, //)(/), 

with the first-order operator 

= K,(f - $( M )(/)) + 
and a second-order remainder term such that 

|tt*07,/*)(/)| < - /if 2 (6® «(/)))| + [(»/- ^)(6)] 2 osc(/)] 
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with some finite constant C' < oo whose values only depend on ||c|| and 
osc(6). Using the fact that 

^(1) = and ||iir;(/)|| = ||^(/-*(M)(/))||<||c||osc(/), 

we conclude that (3.3) and (3.4) are met with S(R^) < C osc(6)(2||c|| + 
osc(6)), and condition (3.2) is satisfied with /?(£>$) < 1 + ||c|| osc(6). 

A. 6. Legendre transform and convex analysis. We associate with any 
increasing and convex function L:t£ Dom(L) i— > L(t) € M + defined in some 
domain Dom(L) C M+, with L(0) = 0, the Legendre-Fenchel transform L* 
defined by the variational formula 

VA>0 L*(A):= sup (Xt-L(t)). 

tSDom(L) 

Note that L* is a convex increasing function with L*(0) = and its inverse 
(L*) _1 is a concave increasing function [with (L*)~ 1 (0) = 0]. 

For instance, the Legendre-Fenchel transforms (ckq, a*) of the pair of con- 
vex nonnegative functions (ao,ai) given below: 

Vie[0,l/2[ ao(t) :=-*- |log(l-2t) 

and 

Vt>0 ai(t):=e*-l-t 

are simply given by 

og(A) = i(A-bg(l + A)) and a*(A) = (1 + A) log (1 + A) - A. 

Recall that, for any centered random variable Y with values in ]— oo,l] 
such that E(y 2 ) < v, we have 

(A.7) E(e tY ) < Ve +e < 1 + vaUt) < expOoaUt)). 

1 + 1? 

We refer to [6] for a proof of (A.7) and for more precise results. For any pair 
of such functions (L\,L2), it is readily checked that 

VteDom(L 2 ) L 1 (t)<L 2 (t) and Dom(L 2 ) C Dom(Li) 

L\<L\ and (L*)' 1 < (L*)" 1 . 
For any pair of positive numbers (u,v), We also have that 
Vt G t> _1 Dom(L 2 ) L 1 (t)=uL 2 {vt) 

VA>0 L^(A) = uL^( — ) and Vx > (Lj) _1 (aj) = uu(L]j[)- 
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As a simple consequence of the latter results, let us quote the following 
property that will be used later in the further development of Section 5.3: 



u<u and v < v => uv{L\ 



Here we want to give upper bounds on the inverse functions of the Legen- 
dre transforms. Our motivation is due to the following result, which avoids 
the loss of a factor 2 when adding exponential inequalities. Let A and B 
be centered random variables with finite log-Laplace transform, which we 
denote by a a and ag, in a neighborhood of 0. Then, denoting by oa+b the 
log-Laplace transform of A + B, 

(A.8) (aft+B)- 1 ^) < (aft- 1 ® + (afc)"^) 

for any positive t (see [25], Lemma 2.1). 

In order to obtain analytic approximations of these inverse functions, one 
can use the Newton algorithm: let 

. x — a*(z) 

f{z) = z+ y , 

and define the sequence (z n ) by z n = F{z n -i). From the properties of the 
Legendre-Fenchel transform, we also have that 

' a{{a')- l (z))+x s 



(A.9) FM-, (o0 _ 1(z) 

Now recall the variational formulation of the inverse of the Legendre-Fenchel 
transform, 

(A.10) (a*) -1 0r) = jnf £~V(*) + x), 

valid for any x > (see [26], page 159 for a proof of this formula). From this 
formula, assuming that a"(0) > and setting z = a'(t), we get that 

(A.11) («TV)= ,mf F(z). 

Let then f(z) = a((a')^ 1 (z)) +x and g(z) = (a')~ 1 (z). From the strict con- 
vexity of a, the function t — > t~ 1 ((a(t) + x) a unique minimum t x and is 
decreasing with negative derivative for t <t x , increasing with positive deriva- 
tive for t > t x . It follows that f/g has a unique critical point z(x), which 
is the unique global strict minimum of F and the unique fixed point of F. 
Furthermore z(x) = (a*) _1 (x). 

Let zq > be in the interior of the image by a' of the domain of a. 
If zq > z(x), then (z n ) is a decreasing sequence of numbers bounded from 
below by z(x). Hence (z n ) decreases to z(x) as n tends to oo. If zq < z(x) 
and F(zo) belongs to the interior of a'(Dom(a)), then z\ > z(x) and (z n )n>o 
is decreasing to z(x). 
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We now recall the convergence properties of the Newton algorithm. As- 
sume that zq > z(x) and let A be a positive real such that F"(z) < 2 A for 
any z in [z(aj), 2q]. Then, by the Taylor formula at order 2, 

(A.12) < z n - z{x) < A^-^zq - z(x)) (2?l) , 

which provides a supergeometric rate of convergence if A(zq — z(x)) < 1. 

Since F depends on x, A is a function of x. In order to get estimates of 
the rate of convergence of z n to z(x) for small values of x, we now assume 
that a' is convex. We will prove that 

(A.13) A:= \ sup F"(z)<^^. 

To prove (A.13), we start by computing F" = (f j g)" . Since /' = zg' , 

(f/g)'=g'(zg-f)g- 2 . 

Now (zg -f)' = g+ (zg' - /') = g. It follows that 

(f/g)" = go" 1 + (zg - f)(g"g~ 2 - 2</V 3 )- 

Next, for z > z(x), zg(z) — f(z) > 0, so that 

(f/g)"(z)<g'g- 1 + (zg-f)g"g- 2 . 

Under the additional assumption that a' is convex, the inverse function 
(q') _1 = g is concave, so that g" < 0. In that case, for z > z(x), 

(f I g)" (z) <g'(z)/g(z) = (log g)'(z). 

Now log g is the inverse function of ip(t) = a'(e t ). From the properties of a' , 
the function tp is convex, so that log g is concave. Hence (log g) 1 is nonin- 
creasing, which implies that 

F"(z) < g' (z(x)) j g(z(x)) = z(x)g' (z(x)) / f (z(x)) for any z > z(x). 

Since f(z) > x and g'(z(x)) < g'(0) = l/a"(0), we get (A.13), noticing that 
z(x) = F(z(x)) = (a*)~ l (x). 

We now apply these results to the functions «o an d ct\. Using the fact that 

t 2 t 2 

— < ai (t) ■- e * - 1 - 1 < ai(t) := — -- 

2 ~ w - v ' 2(1 -t/3) 

for every t € [0,3[, and applying (B.5), page 153 in [26], we get that 

V2x < K)" 1 ^) < (c^y 1 ^) = V2x~ + (x/3). 

Also, by the second part of Theorem B.2 in [26], the function which is 
the inverse function of the above function, satisfies 



(A.14) 5j(t) > 



t 2 



2(1 + (t/3))' 
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which is the usual bound in the Bernstein inequality. Now z = e — 1, and 
consequently t = log(l + z) and 

x + z — log(l + z) 



F(z) 



log(l + z) 



Set zq = \j2x + (x/3). Then zq > z(x). Hence z(x) < z\ < zq (here z\ 
F(z )). So 

N / + (4x/3) - log(l + (x/3) + y^x) 



(at) _i (x) < zi := 

^ 1J K '~ log(l + (x/3) + V2x) 

(A.15) 

< (x/3) + V2x. 

Furthermore, from (A. 12) and (A. 13) and the fact that zq — z(x) < x/3, 
0<zi-K)^ 1 (x)<^(^)' 1 (x), 

which ensures that 

l&si/(18 + x) < (ai) _1 (x) < z\. 
In the same way, noticing that 

t 2 /(l - At/ 3) < ao(t) < t 2 /(l - 2t) for any t £ [0, l/2[, 

we get 

2y^+ (4x/3) < (aj$) _1 (x) < 2^x + 2x := z . 

By definition of a , we have a' Q (t) = 2t/(l-2t). Let z = 2t/(l-2t). Then 
t = z/(2 + 2z), so that 

= x + MW-Ht)) =2x+ (1 + z) + 2x + log(l + z)-z _ 

K) (*) 2 

Computing z\ = F(zq), we get 

K)~V) < z x := 2x + log(l + 2x + 2^x) 

/ A 161 , log(l + 2x + 2yi)-2y^ 

1 ' ' 2x + 2 v / x 

<2x + 2y/x, 

which improves on the previous upper bound. Furthermore, from (A. 12) 
and (A. 13) 

0<z 1 -(a* o y 1 (x)<^(a* o r 1 (x), 

which ensures that 

9zi/(9 + x)<(q^)^ 1 (x)<zi. 
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